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Abstract 

In the last years, tens of thousands gene expression profiles for 
cells of several organisms have been monitored. Gene expression is a 
complex transcriptional process where mRNA molecules are translated 
into proteins, which control most of the cell functions. In this process, 
the correlation among genes is crucial to determine the specific func- 
tions of genes. Here, we propose a novel multi-dimensional stochastic 
approach to deal with the gene correlation phenomena. Interestingly, 
our stochastic framework suggests that the study of the gene correla- 
tion requires only one theoretical assumption -Markov property- and 
the experimental transition probability, which characterizes the gene 
correlation system. Finally, a gene expression experiment is proposed 
for future applications of the model. 

*These authors contributed equally to this work. 
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1 Introduction 



In a living organism or cell the collective behavior of thousands of genes 
and their products (for example mRNA and proteins) are embedded in a 
complex architecture that creates the mystery of life. More than 10 years 
ago, methods in molecular biology worked on a " one gene- one experiment 
framework, meaning that the throughput is constrained to only one gene 
and therefore, the whole image of gene function is difficult to visualize. An 
emerging technology called DNA microarray/GeneChips PQ 12] appeared in 
the recent years, attracting the interests among biologists, computer scien- 
tists, mathematicians and physicists. This technology allows us to monitor 
the whole transcribed genome on a single chip and offers the possibility to 
capture the correlations among thousands of expressed genes simultaneously. 

In order to understand how the organism works, it is necessary to know 
which genes are expressed, when they are expressed and how fast they do. 
Gene expression is regulated by means of the gene regulation architecture sys- 
tem of the cells, which involves network of interactions among DNA, mRNA, 
proteins and hundreds of small ubiquitous molecules. These interactions in- 
volve many elements and different and complex mechanisms, therefore an 
intuitive understanding of the underlying dynamics is not easy to obtain. 
This is also true for the issue of gene correlation, which is the main aim of 
this letter. In particular, although many approaches and techniques, as for 
example Boolean networks, graph theory and control theory, have been used 
successfully in many cases, they are still far to achieve a general description 
of the dynamics of the regulation and correlation among genes. 

To shed light on this issue, here we propose a new theoretical model to deal 
with multi-gene correlation dynamics based on only one assumption: Markov 
property . Our approach will use the most general multi-variate stochastic 
process in order to obtain predictions about the correlations among genes. 

In a previous work [3], we proposed a constructive approach to gene ex- 
pression dynamics, which re-builds the scale-free organization of genes (i.e., 
expression level k decays as a power- law fc -7 IHE]) observed in recent exper- 
iments jHl El E] • There, we proposed a stochastic approach by assuming the 
Markov property and by using the observed experimental transition prob- 
ability data, which characterize the gene expression system. Although our 
companion paper jSj succeeded to re-build the scale-free distribution, it may 
not provide much information about gene correlation phenomena because 
by construction it is one gene approach-like. Therefore, here our aim is to 
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exploit the novelty of our previous constructive approach by extending that 
one-dimensional model, to a multi-dimensional analysis (i.e., multi-gene cor- 
relation). 

Our approach is based on two fundamental aspects: Markov property and 
stochastic process. 

Markov property. If we say that the system has a Markov property, 
we mean that the future is governed by the present and does not depend on 
the past. Our model assumes Markov property to describe the multi-gene 
expression dynamics. Certainly, the living organisms are systems with long 
term memory, and they are complex systems with large number of elements 
and interactions. However, from a physical point of view, although we may 
not know all the variables in real situations, it may be enough to find a 
reduced number of variables whose behavior in time can be described as a 
Markovian process. Therefore, in our study we may assume that the most 
relevant degree of freedom of the system is the gene expression level, and 
consequently, our gene system has the Markov property. 

Stochastic process. Although many complex systems may be gov- 
erned by non-stochastic processes, in the gene expression problem the ran- 
dom variation is reasonable, plays a relevant role in cellular process, and 
furthermore stochastic noise have recently been measured and studied theo- 
retically [5J EH HU E2] • For example, the expression level of thousands genes 
is very low, which creates intrinsic uncertainties in the number of expressed 
genes in the cells jB]. Furthermore, we can even distinguish between inher- 
ent stochasticity (intrinsic noise) and external stochasticity (extrinsic noise). 
While the origin of the first one are the biochemical processes, and motivates 
that two identical genes become uncorrelated due to that randomness, the 
second one represents sources of extrinsic noise, which change from cell to cell 
(i.e., fluctuations in elements among cells) [HI El- Moreover, the number of 
molecules which are involved in signal transduction pathways fluctuates from 
10 2 to 10 4 . Therefore, the randomness connected with elementary molecular 
interactions and their amplification in the signaling cascade generates signif- 
icant spatio-temporal noise. Therefore, the stochastic approach is justified, 
and it seems more appropriate and plausible than a deterministic approach. 
Finally, it is also worth reminding that the current experimental techniques 
also provide an additional source of fluctuation, which come from the ubiq- 
uitous instrumental noise (which may be around 30% or more) from chip to 
chip with the current GeneChips technologies. 

On the other hand, one drawback of our approach is that the current ex- 
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isting experimental data of gene expression time series gene lacks of enough 
statistics. For example, in yeast ^3] and human ^5] organism experiments, 
they analyzed the fluctuations in time of many genes simultaneously (more 
than 30.000 in the case of human organism) by carrying out only one exper- 
iment 

However, in order to have enough statistics, we believe that by using many 
experiments of gene chips (i.e., many experiments measure repeatedly fluctu- 
ations of many genes in time under the same conditions), we may achieve a 
better understanding of the global nature and dynamics of gene correlation. 
Therefore, the theoretical approach proposed in this letter may be a useful 
guideline for such kind of experiments, and moreover may encourage them. 

The paper is organized as follows. Section 2 describes the theoretical 
background, explains our proposed model for multi-gene correlations and 
presents the results of our simulated data. Section 3 explains the experi- 
mental proposal compatible with our theoretical study, and finally Section 4 
presents the conclusions. 

2 Methods and Result 
2.1 Methods 

2.1.1 Markov property and Differential Chapman-Kolmogorov Equa- 
tion 

Markov property. We use multi-dimensional stochastic process for de- 
scribing the multi-gene correlation dynamics ^21 El El El- Let {X 4 = 
X^,--- ,X^),0 < t < oo} be a multi-dimensional stochastic process. For 
(t n > ■ ■ • > to), the conditional probability density function 

p(x n , t n |x n _i, t n -i; ■■■ ; x , t ) = K X *n = x n |Xt n _ 1 = x n _i; • ■ ■ ; X to = x ) 

is defined as usual manner, where x = (x 1 , • • • , x ) denotes the N dimen- 
sional vector. It is said that a multi-dimensional stochastic process has 
" Markov property" , when the condition 

|x n _l, tn-1) ' ' ' i x Cb ^o) — P( x n, £n|x„_l, <n-l) (1) 

holds for arbitrary t n > ■ ■ ■ > to. In what follows, we assume that the proba- 
bility density p(x, t|x , t ) has the time translation invariance p(x, t|x , to) = 
p(x, t + a|x , to + a ) f° r arbitrary a. 
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Our only one assumption is that the multi- dimensional correlation dynam- 
ics of gene expression obeys the Markov property. More precisely, we assume 
that the expression levels of each gene are denoted by the multi-dimensional 
stochastic process with Markov property X t . 



Differential Chapman-Kolmogorov equation. For the matter of con- 
venience, we write p(x, t) for p(x, t|x , to)- Then, if the multi-dimensional 
stochastic process has the Markov property (Eq. Q), the conditional proba- 
bility density function p(x, t |xo, to) obeys the Differential Chapman-Kolmogorov 
equation valid for a system composed of N genes and reads as follows: 



i=l *ii=l 

+ J dy[W(x\y,t)p(y,t) - W(y\x,t)p(x,t)), (2) 
where the drift term a 1 (x) is given by 

lim- / (y l - x l )T € (y,x)dy = a*(x) + 0(5), (3) 

6 J\y- X \ <S 

and the diffusion matrix 6* J '(x) reads as 

hm- / (y* - x l )(y j - ^)T £ (y,x)rfy = &«(x) + 0(5), (4) 

e J\y-x\<S 

and jump term W(y|x, t) is given by 1 

^(y|x,t) = limT £ (y,x)/e. (5) 

e— >0 

Here T e (y, x) is an Instantaneous Transition Probability (ITP) defined by 
T t (y, x) = p(y, t + e|x, t) for sufficiently small e. 

In the context of gene expression level, Eq. (J2J) represents the dynamics of 
N mRNA molecules (i.e., gene expression) in the cell. By using this equation, 
we can study the dynamics and correlation between genes i and j. 

Next, we will explain three important processes of the Differential Chapman- 
Kolmogorov equation (J2J) in the following paragraph. 

1 Here we remark that although W(x\y,t) = seems to imply that b^^x) = 0, it is 
not correct since, in general, it is not possible to exchange the order of the limit and the 
integral in Eq. Therefore, it is possible that 6 u (x) is not zero, even if W(x|y, t) = 0. 
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Deterministic process. If the diffusion matrix 6 lJ '(x) and the term W(x|y, t) 
vanish, then the Differential Chapman- Kolomogorov equation (J2J) is reduced 
to 

= -E^M^«)). (a) 

i=i 

This equation is deterministic because it does not involve any random fluctu- 
ations. Moreover, this equation is essentially equivalent to the ordinary differ- 
ential equation, and therefore, it is origin of many equations used frequently 
in biology. For example, the control theory used for analyzing chemical-taxi 
of E.coli in [20J is included in Eq. (JEJ). 

Diffusion process. In the absence of the jump term (i.e., W(x.\y,t) = 0), 
the Differential Chapman- Kolmogorov equation reads as 



N a i N Q2 



<9p(x, t) ^ d r ir I ^ 9 



dt dx i ' 2 ^— ' dx i x : > 

i=i *j=i 

which is known as a diffusion process, which will be used later. 

Jump process. In contrast, if we assume that a l (x) = 6 y '(x) = 0, the 
Differential Chapman-Kolmogorov equation takes the form 



= J dy[W(x\y, t)p(y, t) - W(y|x, t)p(x, t)], (8) 

which is known as a jump process. It means that the path trajectory of X 4 
will exhibit discontinuities (large jumps) at specific discrete points. 

It is also known that the jump processes can represent some kind of chem- 
ical reactions (Ref. |T0] ) - Therefore, we may use this equation to analyze the 
metabolic pathways j2I] in cells, which are composed of chemical reactions 
and chemical compounds. 
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How to use the Differential Chapman-Kolmogorov equation. We 

can obtain the dynamics of probability density p(x, t | x , t ) for any time t, 
from experimental data of instantaneous transition probability T e (y, x) (e is 
sufficiently small and fixed), by the following procedure: 

(i) Given the experimental data of instantaneous transition probability 
T e (y, x) (e is sufficiently small), we obtain a 4 (x), 6 lJ '(x) and W(x|y, t) 
by using Eq. (jHJ), (jU) and (j5J) respectively 2 

(ii) By inserting a J (x), & lJ (x) and VT(x|y, t) into the Differential Chapman- 
Kolmogorov equation Eq. (j2J) and by solving this PDE (Partial Differ- 
ential Equation), we can obtain useful information like the distribution, 
the expectation value, the variance and the correlation at any time. 



2.1.2 Initial instantaneous transition data T e (y,x) 

The initial instantaneous transition data T e (y,x) of N genes for studying 
correlations should be determined by the experiment, which measures the 
short-time transition probability between the N dimensional gene expression 
levels x at time t and the N dimensional gene expression levels y at time t+e. 
Although some experiments have been done for measuring gene expression 
time series of many organisms [HI E3 EL the statistics are not enough to 
completely determine the T e (y,x) and we believe that several experiments 
should be done under the same condition to have enough statistics. More- 
over, the stochastic nature of the fluctuations of the gene expression level, 
strongly supports the idea of many experiments-many genes under the same 
conditions. 

Therefore, for the time being, we assume that the expression of the initial 
instantaneous transition data T e (y, x) of N genes correlation is the Gaussian 
type, which seems general enough to illustrate our model. The expression is 
as follows; 



T e (y,x) 



v /(2vre) 7V det(a^' 
1 
2 



(9) 



exp 



-e 1 o i j(y l — x l — e//(m* — x l ))(y j — x j — efx j (m j — x j ) 



2 Notice that we do not need the whole data T c (y,x) for any e. The necessary data is 
T e (y, x) at a sufficiently small fixed e. 
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where is the inverse matrix of (i.e. ^2 k cr %k akj = #})• We note that 
the contraction of the indexes i, j is done by using the Einstein summation 
convention. 

This ITP T e (y, x) (Eq. Q) allows us to analyze the gene correlation 
phenomena and, furthermore, has the mean reverting property, which is as- 
sumed in order to be compatible with the observation in fSJE] (see Fig. 1). 
Here, we give some annotations on the parameters of T e (y, x). m l denotes 
the average expression level of genes % and fi l means the tendency to revert- 
ing to m\ Finally, cr^ indicates the correlation between gene i and j. These 
parameters will be clear to the readers in the following sections. 

2.1.3 Computation of a*(x), 6 lJ '(x) and W(x\y,t) 

In this section, we compute a l (x), 6 u '(x) and W(x\y,t) from the initial data 
of ITP T e (y,x). Inserting Eq. © into ©, © and ©, we obtain 

a ! (x)=lim- / (y i -x i )T e {y,x)dy = ^(m i -x i ) 1 (10) 

e-*0 e J| y _ x |<«5 

and 

^(x) = lim- / (y'-x^yi -xi)T e (y,x)dy = o i 1, (11) 

and 

H/(y|x,t) = hmT e (y,x)/e = 0. (12) 

Here, we remark the following. The drift term a l (x) = fi l (m % — x l ) repre- 
sents that our model has the mean value property (i.e. the gene expression 
level of gene % tends to revert the mean value m l with the quickness see 
Fig. 1). The diffusion matrix 6 lJ '(x) = denotes the correlation between 
gene i and j. Here, we remark that the correlation is constant in our model 
(Eq. (11)) for simplicity, although we could include the x dependence in the 
model in future work. Finally, the jump term W(x\y,t) = means that our 
model does not contain the jump process. 

2.1.4 Emergence of Kolmogorov equation and SPDE 

In the last section, we find out that o i (x) = {i % {m % — x l ), 6 y "(x) = ff y and 
W(x|y, t) = 0. However, in order to keep the argument more general, we 
still consider the drift term a l (x) and diffusion term 6 iJ (x) arbitrary while 
we assume that the jump term vanishes W(x|y, t) = 0. 
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Kolmogorov Equation. If VT(x|y, £) = vanishes, then the Differential 
Chapman-Kolmogorov equation (J2J) becomes the Kolmogorov equation: 



<9p(x, t) 
dt 



E ^{a'MpCM)} + 2 E ^{^w^*)}. ( 13 ) 




where p(x, £) = p(x, t|x , t ). 

SPDE. In addition, it is known that the Kolmogorov equation is equivalent 
to the following stochastic partial differential equation (SPDE): 



Here, the multi-dimensional stochastic variable X t denotes the gene expres- 
sion level, a l (x) = a'(x) denotes the average change of the instantaneous 
transition of the gene expression level per unit time, ^{x) denotes the co- 
variance of instantaneous transition of the gene expression level per unit 
time given by 6 u '(x) = ^2k=i /5 lfc (x)/3 : ' fc (x) (Here we remark that /3 l3 (x) is not 
uniquely determined from bij(x) due to the rotation group ambiguity.) and 
W(£) = (Wi(t),--- ,H / jv(£)) denotes the multi-dimensional Wiener process 
where all the processes are independent of each other dWi(t)dWj(t) = Sijdt. 
We also note that the stochastic calculus in our approach follows the Ito rule 
by construction, and not the Stratonovich rule 3 . 

Correlated-SPDE. Although the above SPDE (JHJ) is good for a theoret- 
ical study, it is difficult to analyze Eq. (|T3j) directly, since Eq. (|T3jl includes 
the multi-components processes of W\{t), ■ ■ ■ , Wx{t) and the ambiguity of 
/3 u (x) relative to the rotation group. Therefore, we transform the above 
SPDE ()14j) into a more convenient expression (i.e, where the correlation is 
explicitly manifested) as follows: 



N 




(14) 



dX\ = a\X t )dt + ftiX^dz^t), 



(15) 



3 See ^3 EH f° r further discussions on the Ito-Stratonovich dilemma. 



9 



and the correlation is given by 

dz\t)dz?(t) = p ij (x)dt, (16) 

where p{x) = ^W{xj, = 7==% , and (-1 < Pij {x) < 1). 

It is important to note that this expression is more easy to understand and 
deal with from the practical point of view, since the SPDE (|15jl has only one 
Brownian motion (fluctuations) component dz l (t), while many components 
of Brownian motion appear as dW l (t). ■ ■ ■ ,dW N (t) in ()14|) . Furthermore, 
all the correlation information among genes are gathered in dz l (t)dz° (t) = 
p^{x)dt in Eq. ()16j) . which by the way is a more easy expression to deal with. 



2.1.5 Analysis of our model 

^From general analysis of Kolmogorov equation, we return to our original 
situation where the drift a*(x) = p}{m % — x l ), the diffusion fe lJ '(x) = cr*- 7 and 
the jumping term W(x|y, t) = 0. Then, by inserting them into the SPDE 
Eq. (fT5j) and (fTB^I. we obtain 

dX\ = p i (m i - X\)dt + ^dzf, (17) 
where a % = \fa^ and the correlation is given by 

dz\t)dz j {t) = p ij dt = ^-dt. (18) 

This reduced model is also known as Vasicek model in financial engineer- 
ing (see Ref. JH]). This SPDE directly gives useful information about the 
properties of the model as follows: 

(%) Mean reverting. The model (Eq. (|17p) has mean reverting property. 
We can observe this phenomena in Fig. 1, which is obtained from data 
of gene expression time series experiments of human and yeast organism 

[13 EU. 

(ii) Multi- correlation. Eq. (|17|) also exhibits an embedded correlation re- 
lationship given by Eq. (|18p. Therefore, by using this relationship, 
we can analyze the gene correlation phenomena among genes using our 
model. 
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2.1.6 Gene expression dynamical solution 

By using Ito formula, we can solve the SPDE (Eq. ([17))) and derive the 
dynamical solution of gene expression 

X\ = rrj + (4 - m^e'^ + a 1 [ e -" i( *-"W(s), (19) 

Jo 

where x^ = X^ is the initial value of gene expression level. ^From this solu- 
tion, we can obtain the most important quantities which contain the relevant 
information of the system: expectation value and variance and covariance of 
multi-dimensional gene expression level at any time as follows: 

E[Xl) = 711' + (4 - m^e - "**, (20) 



V[Xt] = ^(1 - e" 2 ^), (21) 



and 



Cor[Xixi\ = ^-{l - e-^>). (22) 

Roughly speaking, it means that if we specify the initial state (x$ = Xq) 
(for example, a patient receives a chemical treatment which modifies the 
amount of mRNA in cells) , then we may predict the effect of that treatment 
in cells by knowing the expectation value, variance and covariance of multi- 
dimensional gene expression level at any time. 



2.2 Computation of simulated data 

In order to obtain the sample path of multi-gene correlation dynamics from 
Eq. (|T7J), we use the difference equation corresponding to Eq. (fT7|) as follows. 
For sufficiently large number n. We equally divide the time interval [0,i] by 
ti = iAt (i = 0, ■ ■ • , n and At = t/n ), then from Eq. (|17j) we obtain: 

Xf +1 = X\ + p?{m? - X{)dt + a j Az[ ] \ (23) 

where 

Xl=K and Az? = z^ - z^ ~ N n (0, p% (24) 
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where N n (0, p y ) denotes the n-dimensional normal distribution and the mean 
vector is zero and the variance matrix is given by p y . We repeatedly use Eq. 
(|2*3*j) to obtain the sample path. In Fig. 2, we show the five sample paths 
for total positive correlation p 12 = 1, slightly positive correlation p 12 = 0.5, 
no correlation p 12 = 0, slightly negative correlation p 12 = —0.5 and total 
negative correlation p 12 = — 1, respectively. 

Although the correlation p y between genes is easy to be characterized 
by the theoretical analysis of simulated data of gene expression (Eq. Q), 
this issue becomes more difficult in a practical problem. Precisely, the most 
important experimental problem related to the analysis of time series of gene 
expression data by using Microarrays/GeneChips technologies is to identify 
significant correlations between observables of genes. The criteria that we 
may use for considering that two genes are significantly correlated is as fol- 
lows: (1) By using a given time-series set of gene expression experimental 
data, our theory can be used to calculate the correlation value p y ' for each 
couple of genes. In the case that this value is in the vicinity of the value 
one (p y ~ 1) for two genes, we may consider that both genes are significant 
enough correlated. 

In addition, the following criterias should also be taken into account. (2) 
The correlation p 2 - 7 between genes in our approach is obtained by inserting 
the ITP T e (y,x) into the model. This ITP T e (y,x) is obtained by using 
experimental techniques as Microarry/GeneChips. Currently, these tech- 
nologies have a non-zero instrumental noise that in some cases may exceed 
30%. Therefore, it is important that this source of noise is reduced as much 
as possible for each experiment in order to evaluate with more accuracy the 
correlation between genes. Finally, (3) our theory for predicting the correla- 
tion phenomena is based on a stochastic approach. As we explained through 
the text, and in more extension in the next section, many experiments for 
analysing gene expression time series are encouraged to have a enough statis- 
tics to achieve a precise interpretation of the dynamics of gene correlation. 
Therefore, by increasing the number of the experiments, the confidence of 
the correlation observable p y predicted by our theory would be improved. 

3 Experimental proposal 

The most important factor in our model is the initial data of ITP T e (y, x), 
which characterizes the gene correlation system. However, as far as we know, 
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we do not have enough experimental data for completely determining the 
ITP T e (y,x). Therefore, to determine this ITP T e (y,x), we propose an 
experiment, which measures the short-time transition probability between 
the N dimensional gene expression levels x at time t and the N dimensional 
gene expression levels y at time t + e. The novelty, is that such experiments 
should be carried out at least hundred times to have enough statistics, and 
in addition, should be done under the same external condition. 

Interestingly, the stochastic nature of the fluctuations of the gene expres- 
sion level jH], strongly supports the idea of many experiments-many genes 
under the same conditions. 

If we can obtain the experimental data of T e (y,x), our model can pre- 
dict the future behavior of multi-gene correlation using our construction (for 
example, the expectation value and variance of the gene expression level at 
any time in the future) from the initial value of gene expression levels. In 
particular, it would contribute to uncover how to regulate specific genes by 
applying some external action (e.g., a patient under medical treatment). 

4 Conclusions 

We have carried out a theoretical study on gene expression correlation, which 
is one of the crucial topics of genomics in the current post-sequence era. Our 
study indicates that it is possible to analyze the dynamics underlying the gene 
expression correlation phenomena by using only one assumption the Markov 
property. In other words, it means that the multi- dimensional correlation 
dynamics of gene expression obeys the Markov property. 

Our theoretical approach of multi-gene expression dynamics indicates 
that we can specify an initial state (X to = x ) of gene expression in a cell and 
be able to predict the most relevant observables of the distribution of genes as 
expectation value, variance and covariance of multi-dimensional gene expres- 
sion level at any time in the future. This feature represents an important step 
forward in the current analysis of gene correlation analysis and have potential 
implications for genetic engineering, for example by developing personalized 
medicines according to the features of individuals. 

Furthermore, in order to achieve the above described goals we presented 
an experimental proposal. The main idea of this new proposal is that many 
experiments of many genes would be useful for completely uncover the dy- 
namics of multi-genes in cells. 
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It is also worth noticing that stochastic theory offers a huge and rich va- 
riety of tools for studying the gene expression fluctuations, and by extension 
many other cellular phenomena. For example, it is known that the jump pro- 
cesses described by Eq. (8) can represent some kind of chemical reactions. 
Therefore, as future work we may use that equation to analyze the metabolic 
pathways in cells, which are composed of chemical reactions and chemical 
compounds. 

The availability of complete genomes for several organisms has definitely 
opened new and exciting possibilities of studying the gene correlation dynam- 
ics and mechanisms. Consequently, we believe that our theoretical model, 
together with the experimental proposal, may further serve to understand the 
regulatory interactions among genes and contribute to enlighten the advances 
of the post-sequence era. 
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Figure 1: We show the experimental absolute value of gene expression level (ver- 
tical axis) vs. time (horizontal axis) of a selected group of genes which belong to 
human organism 15 . We see that the gene expression value fluctuates around the 
mean value m=6000 and m=1000. 
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Figure 2: We show the simulated results of our model for different values of p. 
Values of p are indicated in each figure, from A to E. (A) p = 1.0 indicates that 
both genes are totally positively correlated. (B) p = 0.5 indicates that both genes 
are slightly positively correlated. (C) p=0 indicates uncorrelation between genes. 
(D) p = —0.5 indicates that both genes are slightly negatively correlated. (E) 
p = —1.0 indicates that both genes are completely negatively correlated. 
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